AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multi-image and video understanding

# Multi-image and video understanding

Internvl3 38B Instruct
Apache-2.0
InternVL3-38B-Instruct is an advanced multimodal large language model (MLLM) that demonstrates exceptional multimodal perception and reasoning capabilities, supporting various tasks such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
Text-to-Image Transformers Other
I
OpenGVLab
468
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase